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Abstract: Despite the widespread use of tlie apnea-hypopnea index in research, its scientific 
and statistical properties have not been examined thoroughly. The index may be viewed either 
as a rate (number of events per hour of sleep) or as a ratio of two variables (number of events/ 
number of hours of sleep). When considered as a rate, the apnea-hypopnea index may be mod- 
eled as the dependent variable, provided that researchers explicitly state which physical property 
they assume to be measuring. On the other hand, the index is rarely, if ever, the preferred model 
of exposure to sleep-disordered breathing (an independent variable), regardless of whether it is 
considered a rate or a ratio variable. Continued indiscriminate use of the apnea-hypopnea index 
in sleep research should be discouraged. 
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Introduction 

"We feel that the use of an 'Apnea Index,' ie, the number of apneas per sleep-hour, 
frequently gives a better indication of the seriousness of the disorder."' The term 
"apnea index" was probably coined in this quote, a parenthetical remark embed- 
ded in the second page of a book that was published in 1978.' Apart from one 
misgiving,^ the apnea index was endorsed by research groups in the early 19808,^"** 
but was later replaced by the apnea-hypopnea index (AHI), the average num- 
ber of apneas and hypopneas per hour of sleep.'"" Since 1983, the AHI has also 
been popularized under another self-explanatory name, the respiratory disturbance 
index." 

Both indices proved helpful in clinical practice, helping to diagnose sleep apnea 
and make treatment decisions. Clinical measures, however, are not necessarily the 
preferred measures in biomedical science, where modeled variables should accord 
with sound research methodology. In retrospect, sleep researchers have adopted the 
AHI and its predecessor too quickly, skipping a formal appraisal of the two variables. 
Only recently do we encounter dissenting writers who question the wisdom of using 
the AHI in research and suggest numerous alternative measures.'^""' This commen- 
tary offers a methodical analysis of the scientific merit of the AHI, and questions the 
indiscriminate use of this variable in research. 

First question: what are we counting? 

Paraphrasing a familiar idiom on apples and oranges, the merit of counting often 
depends on the answer to the following question: Are we counting apples alone 
and oranges alone, or are we counting a mix? Whichever definition of a respiratory 
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event is used, it is inconceivable that all apneas and all 
hypopneas are the same kind of exposure, as far as effects 
are concerned. For instance, the clinical consequences of 
not breathing for 60 seconds cannot be identical to the 
consequences of shallow breathing for 30 seconds, and the 
consequences of accompanying hypoxemia surely depend on 
its magnitude and duration, not on an arbitrary cutoff point. 
Not surprisingly, the hypopnea definition affects the count 
of hypopneas," yet no agreed-upon definition will solve 
the underlying problem: all kinds of apples and all kinds 
of oranges are counted identically in one overly simplified 
variable. Of course, no alternative variable is free of weak- 
nesses either, but deficient measures can still be ranked as 
better and worse. 

Setting that issue aside, how well do we count apneas and 
hypopneas? Judging from the literature, the answer seems 
variable, although several difficulties should be mentioned. 
The AHI depends on physiological and technical variables 
including sleep position, sleep stage, (rapid eye movement 
[REM] versus non-REM), scoring rules, and device-related 
parameters, such as sensitivity, calibration, and the type 
and quality of the signals. ' '^ '^"^^ Interestingly, in the era of 
digital technology, much of the scoring is still done manu- 
ally, leaving the door open for human error Yet all of these 
counting-related issues are the prelude for deeper method- 
ological pitfalls of the AHI. The first insight may be gained 
by a closer look at the name of the variable. 

The AH I: an index or a rate? 

Rate is a well-defined term in mathematics, denoting the 
ratio of the change in one quantity to the change in another. 
In epidemiology and medicine, the denominator of the rate 
is usually the passing of time, which can be counted in years 
(cancer rates), days (case fatality rates), hours (albumin 
excretion rate), minutes (heart rate), or even seconds (forced 
expiratory volume in one second). Some rates quantify the 
change in discrete variables (eg, number of heart beats), 
whereas others quantify the change in continuous variables 
(eg, volume of exhaled air). Although there are several kinds 
of rates, ie, population rate, individual rate, instantaneous 
rate, hazard rate, and average rate, all of them include 
some quantity of interest in the numerator and time in the 
denominator 

The acronym AHI contains the word "index", but this 
measure is not some human-made index like the Standard and 
Poor's 500; it is actually a rate. More precisely, the AHI may 
be written as the average of the hourly rates of a respiratory 
event, as shown next. 



Suppose a person sleeps A: hours where k is an integer, and 
denotes the number of respiratory events during that time. 
Let n. denote the number of events in the /-th hour of sleep 
(/= 1 ,2, ... k), such that N= But the number of events in 
the i-th hour (« .) is related to the rate of events in the i-ih hour 
(denoted i?.) as follows: n. events/1 hour = i?. events/hour So 
n. = R., and, therefore, AHI = Nlk=QLn)lk= OLR)lk, which 
is the average of k hourly rates. 

There is nothing sacred about dividing time into hours. If 
sleep duration is divided into q intervals of 1 0 minutes, rather 
than k intervals of one hour, another AHI (per 10 minutes) 
may be written as the average of q event rates in 1 0-minute 
intervals, (ljR.)/q. We cannot, however, shrink the time 
intervals infinitely and consider an average of time point 
(ie, instantaneous) rates. The instantaneous rate is a deriva- 
tive of some quantity (7) as a function of time (t), but Y=f[t) 
is not differentiable when Y is the count of events by time t 
(because it is not a continuous function). The instantaneous 
rate of a respiratory event does not exist. 

Rates: neither causes nor effects 

We distinguish between two kinds of variables: natural 
variables and derived variables. The former describe proper- 
ties of objects at discrete time points (eg, body mass, height, 
and waist circumference), whereas the latter are made up by 
mathematics (eg, body mass index and waist-to-hip ratio). 
Natural variables make up the causal structure of the natural 
world; derived variables do not. Natural variables are causes 
and effects of other natural variables; derived variables 
are not.^^ 

Rates are not natural variables, and the average rate 
of a respiratory event is no exception. The AHI is not 
caused by anything and does not affect anything; there is 
no causal parameter behind the association of the AHI with 
any variable. To think that the AHI itself (the product of 
mathematics) affects the level of sleepiness is analogous to 
thinking that body mass index, also a product of mathematics, 
affects the level of blood glucose. Both are examples of 
thought bias,^'"^^ which is ubiquitous in science. And as harsh 
and scary as the last claim might sound, it is solidly true and 
yet to be challenged. 

Are all rates, therefore, useless? Not at all. First, many 
rates, such as the mortality rate or the rate of lung cancer, do 
not claim to be more than what they are, ie, the frequency of 
some event. Second, some rates may also be used to impute 
the unknown values of natural variables of interest. For 
instance, airflow (liters per second) serves to measure the 
speed at which air moves, and the forced expiratory volume 
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in one second provides information about airway obstruction, 
which is actually the airway diameter. Likewise, the heart rate 
tells us not only about the frequency of a heartbeat, but also 
about the condition, or location, of the governing pacemaker. 
Derived variables, including rates, may sometimes serve as 
substitutes for natural variables,^' provided we explicitly state 
what they are supposed to measure. 

The AH I substitutes for ... what? 

Scientific inference thrives on explicit, precise assumptions, 
which may be false, but should not include the apologetic 
phrase "do not really know". Scientists are allowed, perhaps 
even encouraged, to remain uncertain about whether A is 
a cause of B, but they may not declare uncertainty about 
what A and B stand for.^' A measurement in science must 
be coupled with an explicit theory of what is assumed to 
be measured, because a measured variable is merely a 
substitution for some variable of interest. ^'-^^ Yet all too 
often a measurement takes on a life of its own, absurdly 
becoming the focus of the inquiry. In retrospect, that was 
the case with the AHI. For the most part, researchers did 
not bother to state what exactly is being measured by that 
index. What does the average rate of a respiratory event 
substitute for? 

If probed about the causal variable of interest, many 
researchers would probably answer "severity of sleep- 
disordered breathing" or some related idea, such as "the 
level of physiological stress" or "the tendency of the upper 
airway to collapse" (in obstructive events). "Severity", 
"stress", and "tendency", however, are not well specified 
natural variables. They are abstract nouns. If anything, the 
AHI must substitute for some physical property: upper 
airway pressure, upper airway diameter (in the obstructive 
type), excitation of respiratory muscles (in the central type), 
and the volume of inhaled air to name a few examples. That 
a single derived variable can substitute for more than one 
natural variable does not dismiss the duty of researchers 
to state which natural variable they claim to be measuring. 
Moreover, inexplicit measurements open the door to ridicu- 
lous inference, as illustrated next. 

Consider a linear regression model that an analyst 
might fit: 

Log(AHI + 0.1)= P„+PjNECK + p.V + e 

where NECK is neck circumference, V denotes a vector 
of all so-called covariates, and p denotes a vector of their 
coefficients. 



Without an explicit theory of what is being measured by 
the AHI and NECK, a reader may claim that both variables 
substitute for upper airway pressure at a given moment. If 
so, Pj in this model is an absurdity, ie, an attempt to esti- 
mate the effect of a variable on itself On the other hand, 
if the researchers state that neck circumference substitutes 
for the volume of fat around the upper airway and the AHI 
substitutes for subsequent upper airway diameter, they may 
claim that Pj estimates the effect of the former on the latter. 
As that example shows, a regression model can correspond 
to two theories, one logical and another absurd, but a model 
may not correspond to "We are not sure what we have mea- 
sured". To model a variable without an explicit theory of 
what is assumed to be measured is a mathematical exercise, 
not science,^'''^' because no one can corroborate or negate an 
unspecified causal theory. 

The AHI: a good measure 
of exposure to sleep apnea? 

If we are interested in the causes of sleep apnea, the AHI 
may be the dependent variable, substituting for a specified 
physical property as illustrated in the last example. On the 
other hand, the AHI is rarely, if ever, the preferred choice 
whenever we study the possible consequences of sleep 
apnea, such as sleepiness, stroke, blood pressure, and death. 
Several lines of reasoning argue against modeling the AHI 
as an independent variable. 

First and foremost, the hallmark of sleep apnea, whether 
obstructive, central, or mixed, is reduced volume of inhaled 
air (which might lead to hypoxemia, sympathetic surges, and 
microarousals). That type of exposure calls for modeling 
assumptions, and it is unclear why the average frequency of 
a respiratory event is a better model than the number (and 
type) of respiratory events, the total duration of respiratory 
events,'^ or other proposed measures.'''""' By analogy, which 
variable better captures exposure to cigarette smoke while 
awake: the rate of cigarette smoking or the total number of 
cigarettes smoked? Why do researchers prefer to model, for 
instance, pack-years of smoking rather than a smoking dis- 
turbance index, ie, the average number of cigarettes smoked 
per hour of wakefulness? Any logic behind a preference for 
the respiratory disturbance index in sleep research should 
have carried to the smoking disturbance index in smoking 
research. 

Second, modeling the AHI leads to excessive informa- 
tion bias that can be avoided by using alternative variables. 
Consider two people who share the same value of the AHI, 
say 10, one of whom usually sleeps 7 hours each night 
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whereas the other usually sleeps 5 hours. The AHI classifies 
them as "equally exposed", which is obviously wrong when 
we study the effects of repeated exposure to sleep apnea. In 
contrast, the total number of respiratory events classifies 
these people differently (70 versus 50), possibly ranking 
them, for example, according to the volume of inhaled air 
during sleep. If researchers think that the interval between 
successive events might play the role of an effect modifier,^* 
they may fit an appropriate model; for example, include 
a product term that contains "event-free sleep duration". 
As already discussed, distinguishing between apneas and 
hypopneas and taking event duration into account should 
be considered as well. 

Third, sleep duration is not only the denominator for 
the rate of a respiratory event, but is also a variable that 
substitutes for whatever sleep is supposed to achieve. 
Since sleep and sleep apnea share some effects of interest 
(eg, daytime sleepiness), the logic of putting sleep dura- 
tion in the denominator of the AHI is far from clear. If the 
reasoning appeals to confounding, that is plainly wrong. 
One theoretical exception aside,^' confounding bias is not 
removed by dividing the exposure variable (the number 
of respiratory events) by the confounder (sleep duration). 
Rather, sleep duration should enter the model as a covariate. 
Furthermore, dividing one variable by another is often a 
bad idea in research. 

The AHI: a ratio of two variables 

As we just saw, the denominator of the AHI plays a dual role. 
On the one hand, it is the usual time interval for rate com- 
putation, but on the other hand it is a variable that captures 
sleep duration per sc. Given the latter role, we may view 
the AHI not only as a rate that substitutes for some physical 
property, but also as a ratio of two variables, ie, the number 
of respiratory events (N) divided by sleep duration (S). 
Informally, N may be some measure of sleep apnea during 
the night and S may be some measure of "battery charge" 
upon awakening. Is there any justification for modeling the 
ratio of two variables as an independent variable? May we 
fit models such as the following? 

Systolic blood pressure = (3„ + (3, N/S + p • V + e 
Log (death rate) = p„ + (3^ N/S + p • V 

To answer this question, we should first ask ourselves 
what we have in mind when we model N/S in these equations. 
If we claim that N/S substitutes for a natural variable, and 
name it, we are partially safe. The ratio may be considered an 



imputation of the unknown values of that variable, analogous 
to any measurement.^^ Of course, the substitution may still 
be criticized for having information bias, as argued in the 
previous section. 

In all other circumstances, modeling a ratio such as N/S 
is unjustified because the model entails excessive bias or 
misspecification of causal relations. If only N (or whatever 
N is measuring) affects the outcome of interest, division 
by S adds information bias by creating a variable (N/S), 
the distribution of which differs from that of N. If only S 
(or whatever S is measuring) affects the outcome, the error 
might be compounded by unexplained modeling of 1/S, rather 
than S, on top of detrimental multiplication by N. And if 
both N and S affect the outcome, neither the effect of N nor 
the effect of S is estimated by the coefficient of their ratio 
(misspecification). At best, we may view the ratio N/S as an 
interaction term. For instance: 

Log (death rate) = Po + Pi N(l/S) -i- P • V 

But that does not solve the problem either. Interaction 
models, serving to estimate effect modification between 
two causes of some outcome,^** can be interpreted properly 
only when the components of the product term are included 
as well.^" If researchers hold a theory of effect modification 
between sleep apnea and sleep duration, they should fit the 
following model: 

Log (death rate) = P„ + Pj N + P^ S + P, NS + p • V 

Lastly, even under the paradigm of predicting the out- 
come, ignoring cause-and-effect relationships, modeling 
the ratio alone might be worse than modeling one of its 
components, or worse than modeling the ratio together with 
its components.'" 

Conclusion 

Although there is some justification for modeling the AHI 
as the dependent variable, modeling of the AHI as an inde- 
pendent variable is rarely justified. The number and type of 
respiratory events overnight, their total duration,'^ and other 
recently proposed variables'*""" are better models of exposure 
to sleep apnea. Interestingly enough, shortly before coining 
the term "apnea index",' the same first author preferred to 
report the total number, or duration, of nocturnal apneas,''''^ 
rather than their number per sleep-hour. What caused him to 
change his mind is unclear, but the effect of his casual remark 
lasted for many years. 
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Should previous studies that used the AHI as an exposure 
variable be declared useless? Not at all. In retrospect, we 
may (awkwardly) view the AHI as a measure of the total 
number of respiratory events, just as we may (awkwardly) 
view the body mass index as a measure of body fat. In both 
cases, we can save the model by admitting to have mistak- 
enly added some information bias by dividing the numerator 
(the preferred measure) by another variable. Moreover, we 
may argue that the bias is negligible or small, because sleep 
duration, S, does not vary greatly among many people, and 
N/S is highly correlated withN. Notwithstanding their saving 
of past work, post hoc arguments do not justify perpetuat- 
ing a recognized methodological mistake, regardless of its 
practical significance. 

Many of the arguments presented here extend to other 
ratio variables that are commonly used in sleep research: 
the arousal index, percentage of time in desaturation, and 
percentage of time in each sleep stage. In fact, the modeling 
of ratios has been harshly criticized in other branches of sci- 
ence as well.^'-^"-""^^ However, the lessons offered are far more 
general. First, a variable that is useful in clinical practice is 
not necessarily the preferred variable in biomedical science. 
Second, we should never accept a new variable into science 
on the basis of feelings or authority. Third, if the analyzed 
variable is a derived variable, researchers should explicitly 
state which natural variable they claim to be measuring. And 
if they are unwilling to commit to a clear theory,'^ they are 
not in the business of science.^' 
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